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Abstract. 

We prove that the scale map of the zero-crossings of almost all signals filtered by the second 
derivative of a gaussian of variable size determines the signal uniquely, up to a constant 
scaling and a harmonic function. Our proof provides a method for reconstructing almost 
ail signals from knowledge of how the zero-crossing contours of the signal, filtered by a 
gaussian filter, change with the size of the filter. The proof assumes that the filtered signal 
can be represented as a polynomial of finite, albeit possibly very high, order. An argument 
suggests that this restriction is not essential. Stability of the reconstruction scheme is briefly 
discussed. The result applies to zero- and level-crossings of linear differential operators 
of gaussian filters. The theorem is extended to two dimensions, that is to images. These 
results are reminiscent of Logan’s theorem. They imply that extrema of derivatives at 
different scales are a complete representation of a signal. 
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1. Introduction 


Images are often described in terms of “edges", that are usually associated with the zeros 
of some differential operator. For instance, zero-crossings in images convolved with the 
laplacian of a gaussian have been extensively used as the basis representation for later 
processes such as stereopsis and motion (Marr, 1982), In a similar way, sophisticated 
processing of 1-D signals requires that a symbolic description must first be obtained, in 
terms of changes in the signal. These descriptions must be concise and, at the same time, 
they must capture the meaningful information contained in the signal. 

it is clearly important, therefore, to characterize in which sense the information in an image 
or a signal is captured by extrema of derivatives. 

Ideally, one would like to establish a unique correspondence between the changes of 
intensity in the image and the physical surfaces and edges which generate them through 
the imaging process. This goal is extremely difficult to achieve in general, although it 
remains one of the primary objectives of a comprehensive theory of early visual processing. 

A more restricted class of results, that does not exploit the constraints dictated by the signal 
or image generation process, has been suggested by work on zero-crossings of images 
filtered with the laplacian of a gaussian. Logan (1977) had shown that the zero-crossings of 
a 1-D signal ideally bandpass with a bandwidth of less than an octave determine uniquely 
the filtered signal (up to scaling). The theorem has been extended—only in the special case 
of oriented bandpass filters—to 2-D images (Poggio, et al., 1982; Marr, et al., 1979) but it 
cannot be used for gaussian filtered signals or images, since they are not ideally bandpass. 
Nevertheless, Marr et al. (1979) conjectured that the zero-crossings maps, obtained by 
filtering the image with the second derivative of gaussians of variable size, are very rich in 
information about the signal itself (see also Grimson, 1981; Marr and Hildreth, 1980; Marr, 
1982; for multiscale representations see also Crowley, 1982 and Rosenfeld, 1982 also for 
more references). 

More recently, Witkin (1983) (see also Stansfield, 1980) introduced a scale-space description 
of zero-crossings, which gives the position of the zero-crossing across a continuum of scales, 
i.e., sizes of the gaussian filter (parametrized by the o of the gaussian). The signal—or the 
result of applying to the signal a linear (differential) operator—is convolved with a gaussian 
filter over a continuum of sizes of the filter. Zero- or level- crossings of the filtered signal are 
contours on the x — a plane (and surfaces in the x,y,a space). The appearance of the scale 
map of the zero-crossing—an example is shown in Figure 1—is suggestive of a fingerprint. 
Witkin has proposed that this concise map can be effectively used to obtain a rich and 
qualitative description of the signal. Furthermore, it has been proved in 1-D (Babaud et al, 
1983; Ytiille and Poggio, 1983) and 2-D (Yuille and Poggio, 1983) that the gaussian filter is 
the only filter with a “nice’’ scaling behavior, i.e., a simple behavior of zero-crossing across 
scales, with several attractive properties for further processing. In this paper, we prove a 
stronger completeness property: the map of the zero-crossing across scales determines the 
signal uniquely for almost all signals (in the absence of noise). The scale maps obtained 
by gaussian filters are true fingerprints of the signal. Our proof is constructive. It shows 
how the original signal can be reconstructed by information from the zero-crossing contours 
across scales. It is important to emphasize that our result applies to level-crossings of any 
arbitrary linear (differential) operator of the gaussian, since it applies to functions that obey 
the diffusion equation. 

Our fingerprints theorems can be regarded as an extension of Logan’s result to gaussian 
filtered, nonbandpass signals and 2-D images. There are. however, some important 
differences between Logan’s Theorem and the fingerprints theorems. Logan uses a 
bandpass filter, at one scale only, and shows that the zero-crossings determine the filtered 
signal. His proof is non-constructive and only applies in 1-D (2-D generalizations exist 
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Figure 1 The scale map of the zero-crossings of the second derivative of a signal (a 1-D 
slice of a natural image). The x axis is the abscissa; the scale, i.e. a, increases from the 
bottom to the top. Our theorem states that this map is a true fingerprint since it determines 
uniquely the signal (modulus the null space of the operator). 


[Poggio et al, 1982] but none are fully satisfactory). The fingerprints theorems determine 
the original image from the zero-crossings of the image filtered at different scales. The 
proof is constructive and applies in both 1-D and 2-D. Reconstruction of the signal is of 
course not the goal of early signal processing; Symbolic primitives must be extracted from 
the signals and used for later processing. Our results imply that scale-space fingerprints 
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are complete primitives, that capture the whole information in the signal and characterize it 
uniquely. Subsequent processes can therefore work on this more compact representation 
instead of the original signal. 

Our results have theoretical interest in that they answer the question as to what information 
is conveyed by the zero- and level-crossings of multiscale gaussian filtered signals. From 
a point of view of applications, the results in themselves do not justify the use of the 
fingerprint representation. Completeness of a representation (connected with Nishihara’s 
sensitivity) is not sufficient (Nishihara, 1981). A good representation must, in addition, be 
robust (i.e. stable in Nishihara's terms) against photometric and geometric distortions (the 
general point of view argument). It should also possibly be compact for the given class of 
signals. Most importantly it should make explicit the information that is required by later 
processes. Fingerprints of images may have these additional properties. Their compactness 
property, for instance, can be defended with the same type of heuristic arguments used to 
justify edge detection. 1 


2. Assumptions and results 

We consider the zero-crossings of a signal I(x), space-scale filtered with the second 
derivative of a gaussian, as a function of x , a. Let F and E be defined by 





E(x, a) = 


iL 

dx 2 


I *G 


d 2 


E{x, a) = I(x ) * ~ [G(z, o-)] 




d 2 _ 

dx 2 


exp 




d$. 


Notice that E(x, a) obeys the diffusion equation in x and a: 


[ 2 . 1 ] 


d 2 E IdE 
dx 2 a da' 


[ 2 . 2 ] 


We restrict ourselves to images, or signals, P such that E can be expressed as a finite 
Tayior series of arbitrarily high order. Observe that any filtered image can be approximated 
arbitrarily well in this way. 

We will show that the local behavior of the zero-crossing curves (defined by E[x,a) — 0) on 
the x — a plane determines the image up to an harmonic 2 function <p(x), such that — 0. 
The proof of this result will then be generalized to 2-D. We will also discuss its (obvious) 
extension to zero- and level-crossings of linear (differential) operators. More precisely we 
will prove the following theorem: 

Theorem 1: The derivatives (including the zero-order derivative) of the zero-crossings 
contours defined by E(x,a) — 0, at two distinct points at the same scale, determine uniquely 

J Clearly, the scale map fingerprint cannot always be a more concise description of the signal than 
the signal itself, unless the signal is redundant in precisely the way that the fingerprint representation 
can exploit. We expect this to be the case for images, if an appropriate differential operator is 
used, because images are not a purely random array of numbers. Usually images consist of rather 
homogeneous regions that do not change much over significant scale intervals. 

2 Th:s indeterminancy is not a problem. It has long been known that the human visual system is rather 
insensitive to linear illumination gradients. Our reconstruction scheme provides the Laplacian of the 
image I in terms of Hermite polynomials. It is easy to integrate a function of this type to obtain I. 
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a signal of class P up to an harmonic function of x and constant scaling (except on a set 
of measure zero). 3 

Note that the theorem does not apply to signals that do not have at least two distinct 
zero-crossings contours. Another remark is relevant here: the gaussian filter seems critical 
for our proof, but we cannot show that it is the only filter with this property. In section 4 
we will extend Theorem 1 to the two dimensional case: 

Theorem 2: Derivatives of the zero-crossings contours, defined by E(x,y,cr) — 0, at two 
distinct points at the same scale, uniquely determine an image of class P up to an harmonic 
function of x, y and a scaling factor (except on a set of measure zero). 

If the signal is not a polynomial, a similar weaker result can be proved. 4 A best solution 
can always be found but it may not be unique. These theorems break down when all the 
zero-crossing contours are independent of scale (i.e. the contours go straight up in the 
scale-space fingerprint). This is a rare, though interesting, special case and is discussed 
in detail in a future paper [Yuille and Poggio 1983, in preparation]. It can only occur for 
functions which cannot be represented as finite polynomials. 

The theorems do not directly address the stability of this reconstruction scheme. The first 
question concerns stability of the reconstruction of the filtered function E(x,o) at the a 
where the derivatives are taken. Note that our result relies only on two points on the 
zero-crossing contours. Exploitation of the whole zero-crossings contours should make 
the reconstruction considerably robust. The second question is about the stability of the 
recovery of the unfiltered signal V 2 J(i) from E(x,a). This is equivalent to inverting the 
diffusion equation, which is numerically unstable. Reconstruction is, however, possible with 
an error depending on the signal to noise behaviour (see later). 5 

2.1. Outline of the 1-D Proof 

We summarize here the 1-D proof from a slightly different point of view that clarifies its bare 
structure. 

The proof starts by taking derivatives along the zero crossing contours at a certain point. 
Such derivatives split into combinations of x and t derivatives (where t — a 2 / 2 )- Because 
the filter is assumed to be gaussian, however, derivatives can be expressed in terms of x 
derivatives. This is a key point: since the filtered signal E(x, t) satisfies the diffusion equation, 
the t derivatives can be expressed in terms of the x derivatives simply by E t — E xx . The next 
stage is to find the x derivatives of E(x,t) up to an arbitrary degree n from such derivatives 
along the zero crossing contours in the x — t plane. We show that this can be done by 
using 2 points on 2 contours. (It is possible that one point is sufficient, but we are as yet 
unable to prove this.) Since E(x,t) is entire analytic, because of the gaussian filtering (see 
Appendix 2), it can be represented by a Taylor series expansion in x. Since we know the 
values of the n derivatives of E(x,t) with respect to x, we know its Taylor series expansion 
and hence E(x,t). The unfiltered signal F(x), (E(x,t) — F(x)*G(x,t)) can then be recovered 

3 For a general operator the reconstruction is modulus the null space of the operator. Harmonic 
functions are the null space of the Laplacian. 

4 In this case, the signal is determined by the zero-crossing contours in the L 2 sense only. This 
means that the signal may not be determined correctly on a set of measure zero. However, if the 
image is assumed to be an analytic entire function (see Appendix 2), section 3.4 implies that it can be 
determined exactly everywhere. Since images - like any physical signal - are effectively bandlimited by 
the measurement (or imaging) process, they can be considered as restrictions to the reals of analytic 
entire functions (see Appendix 2). 

5 lf E(x, a) is obtained at all a - this can be done by applying the reconstruction scheme at two 
points at each a - robust reconstruction of I(x) can be achieved in the following way (Hummel and 
Zucker, in prep.). Since / E(x, t)dt = / (1* G) xx dt — f (I*G),dt — E(x, o) with the integration between 
0 and oo, 1 (x) can be reconstructed by integrating E(x,a) across a (with t — a 2 /2 the diffusion 
equation is E x x — E t ). In practice, the limits of integration will be finite, originating small errors in the 
reconstruction. 
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in the ideal noiseless case by deblurring the gaussian. A particularly simple way of doing 
this is provided by a property of the function <t> n in which we will expand the function F: the 
coefficients of an expansion of F(x) in terms of <p„ are equal to the coefficients of the Taylor 
series expansion of E(x,t). In the presence of noise, the recovery of F(x) from E{x,t) is 
obviously unstable, it is limited by S/N ratio since high spatial frequencies in the signal are 
masked by the noise for increasing t. (For instance, if F(x) — Yl a v ei>lx < the filtered signal 
is E(x,t) = £ a M e !>z e—Note that since the zero-crossing contours are available at all 
scales a reconstruction scheme that exploits more than 2 points will be significantly more 
robust, As one would expect, the reconstruction of the unfiltered signal is therefore affected 
by noise. The reconstruction of the filtered signal E(x,t) is likely to be considerably more 
robust. We plan to study theoretically and with computer simulations the noise sensitivity of 
the reconstruction scheme. 


3. Proof of the Theorem in 1-D 

We divide our proof into three main steps. In the first we show that derivatives at a point on 
a zero-crossing contour put strong constraints on the "moments" of the Fourier tranform 
of E(x,a) (see eq. 3.1.4). The second section relates the "moments" to the coefficients of 
the expansion of F(x) = E(x, 0) in functions related to the Hermite polynomial, in the third 
section we show that the "moments" can be uniquely determined by the derivatives on a 
second point of a different zero-crossing contour. 

3.1, The "moments" of the signal are constrained by the zero-crossing contours 

~ —a 2 

Let the Fourier tranform of the signal I(x) be Z(w) and the gaussian filter be G(x,a ) — £e'5? r 
with Fourier transform G(w) = e~‘ * . 

The zero crossings are given by solutions of E(x, t) = 0. Using the convolution theorem we 
can express E(x, t) as 


E(x, t) 




e ,ux u 2 I(w)du. 


[3.1.1] 


and t = a 2 ! 2. The Implicit Function theorem gives curves x(t ) which are C°° (this is a 
property of the gaussian filter and of the diffusion equation, see Appendix 2 and Yuille and 
Poggio, 1983). Let f be a parameter of the zero crossing curve. Then 


d dx d dt d 

d$ d$ dx d$ dt 


[3.1.2] 


On the zero-crossing surface, E — 0 and ^E = 0 for all integers n. Knowledge of the 
zero crossing curve is equivalent to knowledge of all the derivatives of x and t with respect 
to ?. 

We compute the derivatives of E with respect to f at ( x 0 ,t 0 ). The first derivative is : 


-~E(x, f) = -7~ f e w t e ,UJX (iw)u) 2 l(ijj)duj 
d$ d( J 


dt 

d$ 


J e- wSt (-w 2 )e iwx w 2 /(w)dw 


[3.1.3] 


and is expressed in terms of the first and second moments of the function e~ w2i e iux u 2 I(u). 
The moment of order n is defined by: 
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M n 


£ 


(iw) n e-“ t e l ^ x u 2 I[u))du>. 


The second derivative is 

j2 j 2 r 

-^E(x,t)=— J e~“ : '^{iwyiMdw 

(fit f 

+ ^ J e~“ 2 t (—u> 2 )e tux u 2 I(u)duj 

+ (j: j J e-“ af e iwi (- w 2 ) w 2 7( w )(ia; 

+ 2 — — J e~~' " 4 (—w 2 )e' WI (iw)w 2 7 (w)tfw 

(dt\ 2 f a 

+ ( J e~“ t (oj i )e~'“ x u J 2 I{u)doj. 


[3.1.4] 


[3.1.5] 


Since the parametric derivatives along the zero crossing curve are zero, equation [3.1.3] is a 
homogeneous linear equation in the first two moments. Similarly, [3.1.5] is a homogeneous 
linear equation in the first four moments. In general, the n th equation, ^E{x,t) = 0, 
is a homogeneous equation in the first 2n moments. We choose our axes such that 
X 0 = 0. The next section shows that the moments of e-“ ;2i w 2 /(w) are the coefficients a n 
in the expression of the function F(x) in Hermite polynomials. So we have n equations 
in the first 2 n coefficients a n . To determine the a n uniquely, we need n additional and 
independent equations which, as we will show in section 3.3, can be provided by considering 
a neighboring zero crossing curve at {x u t 0 ). 

3.2. The "moments" are the coefficients of the expansion of F(x) 

In this section we show that the moments defined by [3.1.4] can be related to the coefficients 
of the expansion of F(x) in functions related to the Hermite polynomials. We expand 


F[x) = I 3 - 2 - 1 ! 

in terms of the functions ip n [x, <?) related to the Hermite polynomials H n {x) (see Appendix 1) 
by 


<f>n[x,a) = (—l) n — . . ) 

(v^)™+i0F Vv/2a/ 

F l X ) = Z) a n(<r)<Pn(x, O’) 

71 = 0 

The coefficients a n (a) of the expansion are given by 


[3.2.2] 

[3.2.3] 


a n (cr) = (w n (x, a), F(x)} [3.2.4] 

v^here (,) denotes inner product in L 2 and {w n (x,a)} Is the set of functions biorthogonal to 
{<p n (x,o-)}. The {ip n (x,a)} are given explicitly by 


<Pn[x,<r) — 


t 2 n—1 


i\s/2V^ 


-e*° 2 


d n 

dx n 


e 


[3.2.5] 


6 
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and the w n (x,a) by 


Since 


d n —i 2 


W*) = 


— I e iux u 2 I(u)du 


[3.2,6] 


[3.2.7] 


the a n are given by 




a n (a) = 



d n 

-—e 
dx n 


, e iw *)w 2 /(w)dw 


[3.2.8] 


The inner product in [3.2.8] is just the inverse Fourier transform of w n (x): Therefore, 



[3.2.9] 


which is equal to M n modulus a factor e iwx . We will need to consider the derivatives along 
the zero-crossing contours at two points. We can choose coordinates so that these points 
are (0,a o ) and (xi,cr 0 ). 

Therefore knowledge of the image is equivalent to knowing the a n . 


3.3. Combining information from two contours 

The derivatives at {xi,t 0 ) give us n equations in the first 2n moments of e -w2t e' w:Cl w 2 /(w). 
We can relate them to the expansion coefficients of the function 


F(x + 



e ‘"*e <u “w a /(«)dw 


[3.3.1] 


in terms of the tp n functions. 
We write 


2 n 

F(x + xi) = Yl b n (p n (x) 


0 


We have n equations for the 2 n unknowns b n . Now observe that 


[3.3.2] 


2 » 2 n 

b n <p n (x) = a n ( Pn(x + Xi) [3.3.3] 

0 0 

Any ip n (x + z,) can be expressed as a linear combination of <p m {x) with m < n, as we will 
show in section 3.3.1. 

Thus we express each 6 n ’s in terms of a„’s and then we combine the equations from two 
points to obtain 2 n equations for the 2n coefficients a n . Thus with the results of the next 
two sections the proof will be complete. 


7 
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3.3.1. Change of basis 

For a given a, the functions are given in terms of Hermite polynomials (see eq. 3.2.2). 
We show how we can express Hermite polynomials with origin at x — 0 in terms of Hermite 
polynomials with origin at x = — x x and hence perform a change of basis. 

We show that H n (x — x } ) can be expressed as a linear combination of H„{x) with m < n. 
Let us consider the generating function e ~ p ' + ' 2px which defines the Hermite polynomials as 


e —P a +2pz 


CO 


= E 


Hn(x) 
n! P 


Equation [3.3.1.1] gives at x + xi 


[3.3.1.1] 


g — p 2 +2p(x— ii) __ H n (x X l) pn 


n — 0 


n! 


The left hand side of equation [3.3.1.2] can be expanded as 

,-p*+2px p -2px 1 = Hn ^ p n jr (~ 2x ) mp 


_ n! m\ 

n—Q m—0 


Term by term comparison of equation [3.3.1.2] and [3.3.1.3] gives 

H n (x-x 1 )= jr ( n )H m (x)(-2 Xl ) n ~ m 

m= 

The series obtained by substituting equation [3.3.1.4] into 

oo 

f(oo) — b nH n (x — £j) 
n—0 

is a series of the form 

CO 

2 d n H n (x) 


n=0 


[3.3.1.2] 


[3.3.1.3] 


[3.3.1.4] 


[3.3.1.5] 


[3.3.1.6] 


that converges to f(x) as the following argument demonstrates. If f(x) is in L 2 it can be 
expanded in terms of the H n . Similarly, g(x ) = f(x — Xl ) is also in L~ and can be expanded 
as 


oo 

9( X ) = X) C n H ni x ) [3.3.1.7] 

n=0 

Thus, f(x — xi) can be expressed as a linear combination of H r (x) and this series coincides 
with eq.(3.3.1.6) because of the uniqueness of the expansion. In particular, we obtain for 
the <p n 




ip n {x — X U t ) — <p n (x, t ) + Xiipn-xix, t ) + . . . [3.3.1.8] 

where the remaining terms are functions with m < n — 1, multiplied by polynomials 

in xi. 
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Observe that if we restrict ourselves to polynomial functions of x a change of basis will 
correspond to transformating an n-th order polynomial in % into an n-th order polynomial 
in x — x\. This argument suffices for our theorems which are restricted to finite polynomial 
functions of class P. For the sake of completeness, however, Appendix 3 deals with the 
general case. 

3.4. Independence of the equations 

We have to show that information from 2 points yield a unique solution. The first n equations 
in the 2 n first moments from a point can be written as 



( M x {x)\ 

M 2 (x) 

[M 2 n (x)J 


(3.4.1) 


The matrix of the coefficients is a nx2n matrix. Note that its rows are linearly independent 
(since the coefficients of the r ih row vector are zero after the 2 r th component). 

The next n equations are given by the matrix of the derivatives at a second point, x it that 
have the same form as eq. (3.4.1), multiplied by the moments at (x -f x x ). 




/“"’N 


(f 

£ 


0 

0 

\ dc;* 

sin. 

d $ 2 

+(?r 

2f$ 

«) 


0 

0 


... 0 

... 0 


f Mjfz-Fzi)'! 

M 2 (z + fci) 


= 0. 


\M 2 „(z + zi)/ 


(3.4.2) 


The moments at (z + zO can be expressed in terms of the moments at z by the following 
transformation (see section 3.3). 


x\ X s . z 2 "\ 

1 X 1 T 3T 27iT 

( Mi(z) \ 


( M x (x -f- zj) ^ 

0 1 xi ... 

M 2 (z) 


M 2 (x + zj) 

0 0 1 Z! ... 

! 

= 

: 

1 1 

\M 2n {x)J 


^M 2n (x -f- X X ); 


Equation (3.4.3) substituted into (3.4.2) gives, together with equation 3.4.1, the full set of 2n 
equations in the 2n unknowns M,(x). The 2 n x 2 n matrix of the coefficients can be thought 
of as originating from the first point (the top half) and from the second point (the bottom 
half) on the zero-crossing curves. 

In general, the determinant of this matrix is non-zero. Intuitively, if the filtered signal has 
nonzero moments of order higher than 2n, the system of 2n equations would not have a 
solution. A proof for this claim is given in Appendix 4. The argument is based on the fact 
that the determinant of the coefficients is a polynomial in x x . If this vanishes, then x x can 
be expressed in terms of the first n derivatives at the two points. We show, however, that 
in general it is possible to change x continuously without altering the first n derivatives. 
This implies that the determinant is almost always different from zero. The argument breaks 
down if the filtered signal is a polynomial of degree 2n or less. 

In this case, the determinant must be zero, since the homogeneous set of equations has 
at least one solution. At this point, we have to show that the solution is unique. We first 
observe that the determinant of the coefficients of the 2 n x 2 n system of equations is a 
polynomial in x x . This polynomial is nontrivial 0 since the first n and the second n equations 

6 This argument cannot be applied when all zero-crossing contours aie vertical straight lines: in this 
case it is impossible to reconstruct the signal [Yuille and Poggio 1983, in preparation] 
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separately are independent. It follows that the determinant vanishes at a finite number (at 
most 2n) of values of x x . 

Suppose the determinant is zero. Observe that is known from the position of the zero¬ 
crossing curves (x { is the distance between the two points at which derivatives are taken). 
Typically the roots of the polynomial in x, will be distinct and there will be a unique zero 
eigenvector of the matrix. Thus we have proved that n derivatives at two points determine 
uniquely (modulus a common scaling factor) the 2n moments of a polynomial of degree 2 n. 
The case of multiple zero eigenvectors is nongeneric, i.e., an arbitrarily small perturbation 
in the “image" would annihilate eventual multiple zero eigenvectors. Furthermore, multiple 
zero eigenvectors of the matrix of degree 2n must also be multiple zero eigenvectors of all 
higher order matrices which is even more unlikely (except on a set of measure zero). 

Our proof is limited to filtered function of the polynomial type (albeit of very high degree). 
We now stretch an argument suggesting that the result holds also for most filtered functions 
E(x,y) which are not polynomials. 

Consider the homogeneous system of equations obtained from two points up to the moment 
M 2 „. Denote with A' the matrix of the coefficients. Let A be the matrix of the coefficients of 
the inhomogeneous system of equations obtained by dividing all unknowns Mj to M 2N by 
the first moment. The system, AM' — Z, where Z is the first column of A 1 divided by M u 
does not in general have solutions as we have shown (see Appendix 4). Furthermore, A 
has no null vector (if it has, then A' must also have a null vector, which is impossible since 
detA 1 o.). Then there is a unique least square solution of the equation || AM 1 — Z\\ — 0 

given by M' — A~~Z, where A + is the pseudoinverse of A [see Albert, 1972]. Thus for every 
finite M there is a unique least square solution to the system of equations AM' = A but no 
exact solution. As n goes to infinity, however, at least one exact solution must appear. 

To summarize, in section 3.1 we show that the moments of the signal are constrained by the 
derivatives of the zero-crossing contours at one point. Section 3.2 shows that the moments 
are equal to the coefficients of the expansion of the unfiltered signal F(x) in our Hermite-like 
expansion (and also equal the coefficients of the Taylor expansion of the filtered signal 
E(x,t)). In section 3.3 we show how we can combine constraints from two different points 
on the zero-crossing contours at the same scale. Finally, section 3.4 demonstrates that the 
equations obtained in this way from two points determine a unique solution. The stability of 
the solution was briefly discussed in section 2. - 


4. The Extension to Two Dimensions 

The function 


can be expanded in terms of the <p„(x,t) and <p n (y,t) as 


(4.1) 


F(x) = J2 anm(t)<Pn[x,t)<Pm{y,t) 

n,m 


(4.2) 


with the coefficients given by 


a n m(t) — (T’C?;), W n (x, t)w m (y, t)) 


J '(tw a )"(iw y )" 


-LJ t 


/(w)dw. 


(4.3) 


We define T(w) as 
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T(w) - e^’% 2 /(w)e-^ a< 

and 

E{x,t) = J T{u)dw. 

We now take derivatives of E(z, t) on the zero-crossing surface. Thus with 

d dx d dy d dt d 

df df dx df dy df dt 

the first equation 


(4.4) 


(4.5) 


(4.6) 







gives, where u = (w 2 , u y ) = (w a , w 2 ) and w 2 = W;c 2 + w y 2 , 


(4.7) 



and the second equation 


gives 


d 2 E 

df a 


= 0 


(4.8) 


(4.9) 



(4.10) 


where we use the summation convention over i,j. 

These equations, and the higher order ones that can be obtained in the same way, are 
equations in the moments S 7r _ pq where 


Smpq — J U 2n <jJ P x U q y T(uj)du. (4.11) 

The ri-th order equation will consist of terms of the type S mpq with m + p + g < n. We will 
show that, using different curves on the zero-crossing surface through (z 0 ,ya,f 0 ), we can 
produce one equation for each pair of moments with m + p-j-g = n in terms of the moments 
with m -f p 4- q < n. As in the 1-D case we have half the equations we need to solve for 
the moments. Again we can consider a second point on another zero-crossing contour (at 
the same scale). Combining the equations from two points, after a change of basis, yields 
enough information to determine the image. 
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There are an infinite number of curves on the zero-crossing surface which pass through 
any given point ( x 0 ,y 0 ,t a ). Since the surface is two-dimensional, the tangents to these 
curves form a two-dimensional vector space. Each curve will give different equations for 
the moments. We will show that by taking different curves through the same point (x 0 ,y 0 ,t 0 ) 
we can nearly obtain enough linearly independent equations to determine the moments. 
As in the one-dimensional, case we show that we can obtain the remaining information by 
considering the behaviour at a second point on the zero-crossing surfaces. 

We first consider the lowest order equation (4.8). As we mentioned above there are two 
linearly independent tangents to the zero-crossing surface at [x 0 ,y 0 ,t 0 ). Thus we have 
two linearly independent vectors ( 377 , g—-, 3 ^) and ( 377 , 377 , 377 ) where we use fi and f 2 to 
denote the parameters on the two curves. We can substitute tnese vectors into equation(4.8) 
and obtain two equations for three unknowns (the three moments with n = 1). These are 
sufficient to determine the moments up to a scaling factor. This factor corresponds to 
scaling the image I and cannot be determined from the zero-crossings. 

We now consider the n-th order equations and show that there is one equation for each pair 
of moments with p-j-g + m = n in terms of the moments with p -f- q + m < n. The moments 
we need to solve for are the S mpq with m -f- p -f- q — ». Fix m and consider the moments 
as p and <7 vary. There are n — m + 1 possible moments, however we will show that only 2 
of them are independent. The moments are given in equation (4.11). Adding the moments 
S m ,p,q and S m ,p+ 2 , 9-2 gives us the moment S m+ i, pw _ 2 . Now, since m + p + q = n, we find 
(m + 1) + p 4 (q — 2 ) = n — 1 and so the moment S m + i, P , ? _ 2 is of order n — 1 . Thus if 
we know S m<p , q we also know S m , p+ 2 , ? — 2 - We can repeat this argument adding 2 to p and 
subtracting 2 from q or vice versa. Thus if we know the moments S m ,o,n-m and 5 m ,i,n—m —1 
we can use this argument to find the other moments. Hence for each value of m there are 
only 2 independent moments. The case m = » is special and only has one term, m can 
vary from 0 to n and so there are a total of 2 n +1 independent moments with m + p-j-q = n, 
We show that for each m it is possible to get one equation for the two moments. 

The coefficients of the unknowns will be A pqm where 


A 


pqrn — 


n! f dx \ p (dyV 

m\p\ql\d( J \d$ ) \d$ J 


(4.12) 


evaluated at x 0 , yo, to and thus the expansion containing the unknowns is of the form 


^ ' A m pqS m pq (4.13) 

m,p,q 

We consider now the terms for a fixed ». Since we can take the derivatives along arbitrary 
directions on the surface, the terms dx/d$ and dy/dc take independent values (while dt/d( 
is constrained since the curve must lie on the zero-crossing surface). We will show that it 
is possible to eliminate all the new moments (those with m + p + q = n) with m > 0 and 
obtain one equation for the two independent moments with m = 0. In a similar way we 
can get one equation for the two independent moments with m = 1 and so on. First we 
eliminate the moment with m — n. We consider N curves with parameters (fi,f 2 ,---f/v)- We 
normalize the curves by requiring 


= 1, * = 1,2,.. .N (4.14) 

oft 

This is always possible unless we are at an extrema of the zero-crossing surface. Suppose 
we write down the n-th order equations corresponding to two different curves. From 
equations (4.12) and (4.13) the coefficients of 5 n , 0 ,0 are unity in both cases. Thus we can 
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subtract one equation from the other and obtain a linear equation for the new moments with 
m < n. 

Now we show we can eliminate the m = n and m = n — 1 moments simultaneously. We 
take three curves with parameters fi,? 2) f 3 and add the three resulting n th order equations 
multiplying the first by X the second by n and the third by v. Comparing coefficients of 
S n , o,o. 1 , 1,0 and S„._ lj0) i gives us three simultaneous equations 




X -f- M + v — 0) 

(4.15) 

, dx dx dx 

(4.16) 

x *7 + ^ + ^“ 0 ' 

x ^ + ^ + ‘'^ = 0 ' 
d$ 1 df 2 af 3 

(4.17) 


These can always be solved since the tangent vectors (i.e the vectors (1, -0-)) form a two 

dimensional vector space and hence any three vectors in this space are linearly dependent 
and satisfy equations (4.15), (4.16) and (4.17). Note that if two curves are the same we 
can satisfy these equations but the resulting equation for the moments will vanish. Let 
the normal to the zero-crossing surface be ( nx,n 2 ,n 3 ). Then the curves must satisfy the 
equation 

n l + n 2 ~T~ + ^3 -T~ = 0 (4.18) 

{ af i 


So we can only vary one of ^ and independently. 

Now we try to eliminate the moments with m = n, m = n — 1 and m = n — 2. We combine 
the n-th order equations from curves with parameters c, multiplying the equations by X( 
and taking the sum. The coefficients of the moments 5„ i0i o, S„_i,i,o, S n _i !0 ,i, S n — 2 , 2,0 and 
S n — 2 , 1,1 (using S n ~ 2 , 0,2 = 5 n _i !0 , 0 — 5 n _ 2t2 ,o, where lj0 ,o is a lower order moment) form 
a vector f t - 



dx 

dU 


\ 


dU 

V %% J 


(4.19) 


where, for simplicity, we have incorporated the factors of n into the moments. To eliminate 
the moments with m > n — 2 we must solve 


N 

X) = °- ( 4 - 2 °) 

t=l 

Because of equation (4.18) there are only three linearly independent vectors which can be 
formed by varying the jf- and taking linear combinations. Thus if A r = 4 we can solve 
the equations (4.20) and eliminate moments with m > n — 2, By increasing the number 
of curves by 1 each time we decrease m by 1 we can eliminate all the moments with 
m > 1. We are left with one equation in the two unknown moments with m — 0. In a 
similar way we can eliminate ail the moments except the two with m = 1 and obtain one 
equation for these two moments. We repeat this process for the higher order moments. 
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Thus the n-th order equations give us one equation for each pair of independent moments 
with p + q + m = n. If we consider all the equations as n varies we find that, as in the 1-D 
case, at each point on a zero-crossing contour we have only half the number of equations 
needed to solve for the moments. As in the one-dimensional case we now consider a 
second zero-crossing surface at the same value of t and repeat the argument. We change 
the basis of the Hermite polynomials as in section 3.3 and obtain enough equations to solve 
for the moments. Substituting these into the equation (4.2) for F{x) completes the proof of 
Theorem 2. 


5. Examples 

We now illustrate the theorems by considering some special cases. If the signal is a low 
order polynomial in X, it is possible to obtain the zero crossing curves explicitly. We then 
use the derivatives of these curves to reconstruct the image, as in the theorem. These 
examples also suggest that the derivatives of the curves at a single point will usually give 
sufficient information to reconstruct the signal. 

Suppose the signal F(X) is a second order polynomial in X. 

F{X) = 1 + AX + BX 2 (5.1) 

where A and B are arbitrary coefficients. All the moments of the signal are zero except for 
the first two. We convolve this signal with a gaussian at scale o and obtain 


E[X, a) = 1 + AX + BX 2 + Str 2 

(5.2) 

We consider the curves given by 


= 0 

(5.3) 

We write these in the form 


<r 2 + {X 2 + (A/B)X + l/S} = 0 

(5.4) 


and see that they correspond to circles in the (X, a) plane. Define X x and X 2 by 


X x X 2 = 1/S 
-[Xi+X 2 )=A/B 

Then we can rewrite the equations as 





(5.6) 


Thus the zero crossing curve corresponds to a semi-circle which intersects the X-axis at 
X x and X 2 (see figure 2). 

We now parameterize the curve by an angle 6 so that 


a(6) = 


(5-7) 
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r^. 





Figure 2 See text. 




We calculate the derivatives 


dX (X 2 - XA . 
dO ~ \ 2 J S ‘ 

( 


da (X 2 - X, 
dS 


2 ~^)co39 


2 ; 

Recalling that t = a 2 / 2, we combine (5.7) and (5.8) to obtain 


& fX 2 - X ] 
d6 


(X 2 -X i y . 

1 — 2 — j " 


sinOcosB 


We differentiate again to obtain 

d 2 X 
dO 2 




(5.8) 


(5.9) 


(5.10) 


We set — b. Then we write the first two equations at 6 = 6 X as 
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/•"V 


( —bsin8\ b 2 sin9\cos8i 0 0 

— bcos9 x b 2 cos 2 8\ — 2b 3 sin 2 8 x cos9x b' i sin 2 9xCOs 2 9 i J 

We pick another point on the curve with the same value of cr. 

$ 2 = 7T — 9 X (with 0 < 9 2 < 7 t/2 ). This gives us a second equation 



This point has parameter 


( —bsind j — b 2 sin9 x cos9 x 0 
bcos9 x 2 cos 2 9\ 2b 3 sin 2 9 x cos 2 9x 


0 

b i sin 2 9iCos 2 9\ 


(l 

Xi 

¥ 


(M x \ 

0 

1 

X, 

X\/ 2 \ 

M 2 

0 

0 

1 

Xi 

M3 

lo 

0 

0 

i / 

\mJ 


V0> 


(5.12) 


Now we consider the equation for the first two moments obtained by taking the first derivative 
at both points. From (5.11) and (5.12), this becomes 


( —bsin9\ b 2 sin6iCOs6i \ /"0\ /- 

— bsin8\ — b 2 sin9icos9i — Xibsin9x)\M2) \o) 

The condition for there to be a solution of (5.13) is that the determinant of the matrix 
vanishes. This occurs at 


Xi — — 2bcos8\ 


(5.14) 


f"*\ 


From (5.7), we see that this is indeed the distance between the two points and so we can 
solve for M x and M 2 . We obtain: 


Mj = bcos9\Mi 

Substituting for bcos9 x from (5,7) yields 

— 

where X 0 is the position of the first point. The reconstructed function is 

F(X) — —M x ipi{X — X 0 ,Oi) + M 2 <p 2 (X - X 0) cr) 
Without loss of generality set X 0 ■— 0. Then up to a scale factor 

\[2ko\ 2\phi ' 1 ' 




Now o x lies on the circle 

at the point where X — 0. Hence' 


(5.15) 


(5.16) 


(5.17) 


(5.18) 


(5.19) 


-XxX 2 


(5.20) 
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Note that X x and X 2 have opposite signs if X = 0 lies on the circle. Substituting (5.20) into 
(5.18) gives 


WX) = 


2v / 2^V'-XiX 2 


{X 1 X 2 -(X 1 +X 2 )X + X 2 } 


From (5.1) and (5.5), we see that this is indeed the original function up to a scaling factor. 
Thus we have demonstrated how to reconstruct the signal. 

We should check that X x = — 2bcosd x remains a root of the determinant for the higher order 
determinants. We will calculate the result for the case n — 2. From (5.11) and (5.12), the 
determinant equation becomes (in unconventional notation) 


bsindi b 2 sindicosd x 
—bcosdi b 2 cos 2 9 x 


-2b 3 sin 2 d x cosdi 


b 4 sin 2 9icos 2 0x 


-bsindi —Xibsindi — b 2 sin 2 d x cosdi —^- L b 2 sindicosd x — X x b 2 sin9 x co$9 x 


. bcos0 x X x bcos0 x -j- b 2 cos 2 0 x 


X 2 /2\bcos$ x +X x b 2 cos 2 9 x + 2 b 3 sin 2 0 x cos 2 9 x B 


where C — -^-bsindi —^-b 2 sind x cosdi and B = ^-bcos9 x -\-^r-cos 2 0 x -i-Xi2b s sin 2 9 x cos 2 9i-{- 
b 4 sin 2 9 x cos 2 6 x . Dividing the matrix by factors common to rows, this becomes 


—1 bcosd x 0 0 

— 1 bcos0 x — 2b 2 sin 2 9 x b 2 sin 2 0 x cosd x 

-1 -X x -bcosd x — X\/2\ — X x bcosd x -Xf/3! — X 2 l2\bcos9 x 

1 X x + bcos0 x X\/2\X x bcosQ x -\-2b 2 sin 2 0 x cos9i V 


where V = + %fbcosd x +Xi2b 2 sin 2 d x cosdi + b 3 sin 2 dicosd x . By adding and subtracting 

rows, this reduces to 


—1 bcos9 x 0 0 

0 0 — 26 2 .sm 2 0x b 3 sin 2 9 x cosdi 

0 —Xi — 2bcos9i ~X 2 /2\ - Xibcosdi —X\/3\ - X{/2\bcosdi 

0 0 2b 2 sin 2 diCOsdi S 


0 (5.24) 


where S — X x 2b 2 sin 2 9icosdi + b 3 sin 2 0icosdi. Thus, the equation becomes (removing 
common factors) 


(X. + 2 W,)J 9i ‘ Xi+4 =0 


and can be expressed as 


[Xi + 2bcosdi){2X x + b + bcosdi) = 0 (5.26) 

Hence, X x = —2bcosd x remains a root for the n = 2 case. 

We turn now to another example and an alternative approach to the problem of determining 
the image from the zero-crossings across scales. Let the signal be a third order polynomial. 
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F{X) = A + BX + CX 2 + X s (5.27) 

If we know that the signal is a third order polynomial we can determine its coefficients by 
derivatives of the zero-crossing contours at a single point. It is straightforward to show that 
this gives 


E(X, t) = A + BX + CX 2 + X 3 + 2(C + 3 X)t (5.28) 

The zero crossings curves are given by 

A + BX + CX 2 + X 3 + 2 [C + 3 X)t = 0 (5.29) 

We will show that by taking a sufficient number of derivatives at one point it is possible to 
reconstruct the signal. From (5.29), we calculate 






B + 2 CX + 3X 2 + 6t + 2(C + 3X)^ = 0 (5.30) 

dx 

2C + 6X + 2[C + 3X)~ + 12-^ = 0 (5.31) 

dx 2 dx 

6+180 + 2(C + 3X)0 = ° (5.32) 

At a point X ^$^,0,0 are known. We write (5.29), (5.30), (5.31) as equations in the 
unknowns A, B, C. 


(I X X 2 + 2t f-X 3 ~6Xt\ 

0 1 2X+2£ # = _ 3 x 2 -6t (5.33) 

VO 0 2 + 20 j\c) \-6X-12$) 

It will be possible to solve these equations uniquely for A, B, and C, provided the determinant 
of the matrix on the left hand size is nonzero. But the matrix is the Wronskian of the functions 
1, X, X 2 + 2 1. Except for a set of measure zero, which we discard, it will only vanish if 
the functions 1, X, X 2 + 2 1 are linearly dependent. But from (5.29) we see this can only 
happen if the function (C + 3X) divides the function (A + BX + CX 2 + X 3 ). Apart from 
this special case, the determinant of the matrix will be non-zero and it is possible to solve 
for A, B and C. 

We now consider the special case. The condition that (C7+3X) divides (A+BX+CX 2 +X 3 ) 
is 


and the result is 




(5.34) 


X 2 

3 


2CX 
9 + 


A 

C 


+ <r 2 =0 


(5.35) 


Note firstly that although for the general third order polynomial (5.29) there are usually three 
zero crossing curves, there are now only two. Secondly, this equation is of similar form to 
equation (5.4) for the second order polynomial but the relative coefficients of u 2 and X 2 are 
different, so the two cases can be distinguished. 
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We can differentiate (5.35) and show that the coefficients A and C can be determined at a 
single point. 

It seems likely that this result will apply to all polynomials functions F(X). Hence knowledge 
of the degree of the polynomial signal allows reconstruction from derivatives at a single 
point. It is furthermore likely that the degree of the image could be determined by the shape 
of the zero-crossing contours. If this conjecture is true this would represent an alternative 
constructive proof of Theorem 1. 


6. Conclusions 

We conclude with a brief discussion of a few issues that are raised by this paper and that 
will require further work. 

a) Stability of the reconstruction. Although we have not yet rigorously addressed the question 
of numerical stability of the whole reconstruction scheme, there seem to be various ways for 
designing a robust reconstruction scheme. The first step to consider is the reconstruction 
of the filtered signal E(x,t). One could exploit the derivatives at n points - at the given a - 
and then solve the resulting highly constrained linear equations with least squares methods. 
Alternatively, it may be possible to fit a smooth curve through several points on one contour, 
and then obtain the derivatives there in terms of this interpolated curve. The same process 
must be performed on a second separate zero-crossing contour. This scheme provides a 
rigorous way of proving that instead of derivatives at two points, the location of the whole 
zero-crossing contour across scales can be used directly to reconstruct the signal (since 
the Implicit Function theorem shows that the zero-crossing curve is C 00 ). 

The second step involves the reconstruction of the unfiltered signal 7(i). We have 
constructed V 2 I explicitly. The construction is in terms of Hermite polynomials which 
can be integrated up straightforwardly to give us I (up to a function such that V 2 <£ = 0). 
Alternatively we can consider F{x) to be the second moment of I(x) (see equation (3.2.7)) 
and then use the moment equations to determine the second and higher order moments of 
I(x) leaving the first two moments undetermined. This reconstruction step is unstable, as 
we discussed earlier, if only one scale is used. I(x) can of course be reconstructed only 
modulus the null space of the (differential) operator. When the differential operator is the 
Laplacian and E(x, t) is available for all t, then the representation is invertible and I(x) can 
be recovered. 78 

b) Degenerate fingerprints. Our uniqueness result applies to almost all signal: a restricted 
but well known class of signals, with vertical zero-crossings in the scale-space diagram, 
correspond to nonunique fingerprints. These signals, which will be discussed in a forthcom¬ 
ing paper [Yuille and Poggio 1983, in preparation], do not belong to the class P introduced 
in Theorem 1 and 2. Interestingly, level-crossings (with a level different from zero) can 
distinguish between elements of this class. Note that there is a further, obvious constraint 
on the reconstruction: the original signal I(x) can be reconstructed modulus the null space 
of the (differential) filter. 

c) Extensions. Our main results are not restricted to second derivatives. They apply to zero- 
and level- crossings of a signal filtered by a gaussian filter of variable size. They also apply 

7 This reconstruction scheme may play a role in the computation of lightness in the vertebrate visual 
system, (Poggio, possibly in preparation). 

“Notice that it may often be possible to assume that the image is an analytic entire function, i.e,, a 
distribution already “diffused" by the imaging process. In this case, the Hermite expansion converges 
analytically everywhere and so does the associated Taylor expansion. If the convergence, however, 
can only be ensured in Lr (i.e., the image is not analytic) there may be some parts of the image where 
the series expansion is not a faithful representation. We conjecture that it should usually be possible 
to infer the presence of such an anomalous region from the behavior of the derivatives of the image. 
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to transformations of a signal under a linear space-invariant operator - in particular they 
apply to the linear derivatives of a signal and to linear combinations of them. In both 1-D 
and 2-D, local information at just two points is sufficient. In practice, since many derivatives 
are needed at each point, information about the whole contour, to which the point belongs, 
is in fact exploited. 

d) Are the fingerprints redundant? The proof of our theorem implies that two points on 
the fingerprint contours are sufficient. As we mentioned earlier, several points are probably 
required to make the reconstruction robust. We conjecture, however, that the fingerprints 
are redundant and that appropriate constraints derived from the process underlying signal 
generation (the imaging process in the case of images) should be used to characterize how 
to collapse the fingerprints into more compact representations. Witkin made already this 
point and discussed various heuristic ways to achieve this goal. 

e) Implications of the results. As we discussed in the introduction, our results imply that 
the fingerprint representation is a complete representation of a signal or an image. Zero- 
and level-crossings across scales of a filtered signal capture full information about it. These 
results also suggest a central role for the gaussian in multiscale filtering that assure that 
zero- and level-crossing indeed contain full information. Note, however, that the fingerprint 
theorems do not constrain or characterize in any way the differential filter that has to 
be used. The filter may be just the identity operator, provided of course that enough 
zero-crossings contours exist. Independent arguments, based on the constraints of the 
signal formation process, must be exploited to characterize a suitable filter for each class 
of signals. For images, second derivative operators such as the Laplacian are suggested by 
work that takes into account the physical properties of objects and of the imaging process 
(Grimson, 1982; Poggio and Torre,in preparation; Yuille, 1983). We plan to explore this 
approach in the near future. 
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Appendix 1 


Properties of Hermite polynomials and truncation of the expansion 
The set of Hermite functions is defined by 



where H n are the Hermite polynomials, 


[ 1 ] 


= [ 2 ] 

The Hermite functions are an orthonormal basis of functions which is complete for L 2 
functions. The completeness is expressed by 

53 Tpn{x)tl>n{$) — S(X — f) [3] 

n 

and the orthonormality by 

J tynip^^mip^dx — S nm [4] 

In general, the Hermite expansion of a L 2 function does not converge uniformly, but only in 
the L 2 norm. The series will converge to the function except at a set of point of measure 
zero. At any point, the series can be truncated at a term of order N such that the remainder 
of the series is arbitrarily small. If we only consider a finite number of points where the. 
series converges, the series can be truncated and the function approximated arbitrarily well 
by a finite number of Hermite components. 

The Hermite polynomials defined in equation [2] and the set of function w n (x ) defined as 
(see equation 3.2.6): 


w„(x) 


1 d n _ x 2 
2 "nlv^F^” 6 


are biorthogonal sets of functions, i.e., 


J H n {x)w m ^X^dx — $nm 

They also obey a completeness property 

53 H n {x)w n (() = 6(x — f) 

nm 

and therefore a L 2 function f(x) can be expanded in either set of functions as 


/(z) = Y2 a n H n{x) 

n 

fi x ) = b n w n{x) 


[5] 


[ 6 ] 


[7] 


[3] 
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with 


Q"n — {/j Wn) 

bn = (f,H n ) 


[9] 


Appendix 2 


Analytic properties of solutions of the diffusion equation 

For completeness we provide results about the analytic properties of functions convolved 
with the gaussian, i.e. solutions of the heat equation with the Huygens property (see later). 
Notice that images can be considered to have undergone already a "diffusion", since the 
imaging process has the effect of convolving the light distribution with a point spread 
function, usually very close to a gaussian. 

Lemma : The solutions of the diffusion equations u(x,t) as functions of x for a given t are 
restrictions to reals of functions which are entire functions of the complex variable (Widder, 
1975, page 64). 9 

This result is noteworthy. The condition that u xx — u t automatically brings with it C°° for 
u(x,t) and even analyticity for the space variable x. Considered as a function of t, u{x,t ) can 
be extended analytically into the complex plane, although it will not generally be analytic 
(see Widder, 1975, page 65). 

F(x, a) — a n[x, cr)v n (x, a) 


The heat polynomials v n are defined as the coefficients of £ in the expansion of 


e xz+t ' 2 = Yl v ni x > t) 


n —0 


n! 


[ 1 ] 


where t— 3 ^. v n is a polynomial of degree n and is given by 

f n /2] n — 2k 

”- (M) “ n! 5 


[ 2 ] 


They are related to the Hermite polynomials H n by 


Vn{x,t)~(-t) n/2 H n (-2- ) [3] 

The heat polynomials are solutions of the diffusion equation. Among other nice properties, 
they obey the following theorem [Widder, 1975]. 

Theorem: Any function u(x,t) which obeys the diffusion equation and has the Huygens 
property can be expanded in a series of heat polynomials which converges uniformly in 
(M)-_ 

9 Entire functions are functions which are analytic everywhere. 
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A solution u(x,t) of the diffusion equation is said to have the Huygens property if it can be 
expressed as the convolution of the Gaussian with its initial value at t — 0. Filtered images 
satisfy this requirement by definition. 

This theorem is important since it ensures that the expansion converges not only in the 
square integrable norm, as is usually the case, but also uniformally (i.e., at each point). This 
ensures that undesirable behavior, like the Gibbs phenomenon, does not end. 


Appendix 3 


Convergence of change of basis 
We can write F(x) in terms of the Vs as 

OO 

F{x) = b n (t)<p n {x + x u t). (1) 

n=0 

This series converges in the L 2 sense: given an e > 0, however small, it is possible to find 
an Ni such that 


/"■* s 




b n {t)<p n (x + x i,f)j dx < e. 

Using (1) we can write equation (2) as 

[( b n {t)ip n (x + xi,t)\ dx < e. 

J 'n—Ni +1 ' 

Similarly for the expansion of F(x) in terms of o n ’s we can find an N% such that 


( 2 ) 


(3) 


/(*>-£ dx 


< €. 


(4) 


Set N = ma.x(Ni,N 2 ). If we can cut off the series for F(x) at N then we can change the 
basis, as in subsection 3.3.1., equate coefficients of the <p n (x,t)’s and hence relate the c„,s 
to the Vs. We would then have N equations for the first N moments. We now show that 
cutting off the series is permissible: if we choose e sufficiently small we can make the error 
involved as small as we like. 

The Vs are related to the 6 n 's by 


«»n(f) = ( b n {t)<p(x + x u t), w n {x, t) 


\n—0 


This can be written as 


N 


a n(t ) = ( y) b n (t)ip(x -f Xi, t), w n (x, t) ) ■+ ( J2 b n (t)ip(x + x u t), w n (x, t ) ). 

\n=0 / \n=N+l / 


(5) 


( 6 ) 
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The second term is called s„ and is neglected when we cut off the series at N. We must 
now show that this is justified. We can write its square as 


m 



Using the Cauchy-Schwarz inequality we obtain 



^ Y2 + *)»**) < J w n (x,t) 2 dx b n (t)(p(x + xi,t)j dx 


The first term on the right hand side of (6) is 


a n — a n = K{t)tp{x -f x u t), w n (x, t)j. 


( 8 ) 


(9) 


Hence, using (8) and (3), we have 


d n 2 < e / w n [x,t)j dx. (10) 

Thus if we make e very small the errors a n ’s will be negligible (we can scale w n (x,t) and 

<p n (x,t) by functions of n so that f^w n (x,t)j dx tends to a finite limit as n tends to oo; 

alternatively, note that both d n (x,t) and a n — a n depend linearly on w n [x,t) and so scaling 
it will not alter their relative sizes). 

The errors involved in terminating the series can be made arbitrarily small by making the 
cutoff N sufficiently large. Thus we can change the basis and obtain N equations for the 
first N moments. We solve these equations and obtain the first N terms in the expansion 
of F(x) in terms of the <p„(x,t). Taking the limit as N tends to oo we reconstruct the image 
(in the L 2 sense). 


Appendix 4 


We will show that the 2n th order determinant is generally non-zero. Recall that the 
determinant is a polynomial in x x (of degree at most 2 n) with the coefficients being functions 
of the first n derivation of the curves at the two points. If this determinant always vanished, 
it would mean that the distance between any two curves with prescribed values of their first 
n derivation could only take a finite set of values (at most 2n) whatever the values of the 
higher order derivation of the curves. We will show that, by changing the values of the 
higher order derivatives, it is possible to alter the value of X x continuously while keeping 
the first n derivation of the curves constant. 

We take two points (0, t x ) and (x u t } ) lying on zero-crossing curves. At these points, 
we assume we know the derivatives §f, ... up to order n. (This means we 

can reconstruct from the implicit function theorem.) We can use the diffusion 

equation to write these as ff, 0,..., 
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So we have 


and 


£(0,fi) = 0 

f(0,<i) = K, 


d 2n E 

d 2n x 


(0, ii) = Kin 




E{xi,ti) = 0 

^{x 1 ,t 1 ) = C l 


d 2n E 

dx 2n 


(xxjti) — Cin 



where Ki,...,K 2n and Ci,...,C 2n are specified. Now we will try to alter the value of x x 
while keeping Ki,...,K 2n and C x ,...,C 2n constant. 


We have 


E{x,t) = I t iax e- uH w 2 I{w)du 

Introduce a “deformation" parameter X and a function Y{uj, X) where 



Y(w, 0) = w 2 /(w) 


and x x = 2 i(X). 
Let 


E{x, t, X) 


/' 




Y(w,X)dw 


(4) 


(5) 


Allow zi(X) to vary while maintaining equations (1) and (2). For the first point this gives 

/ e ~ w2t ^ y (w,XMw==0 

I e-“ !t w 2n ^(w,X)dw = 0 

For the second point we obtain 


[ e lMX 'e-“ h (iu)Y{u,\)dGj+ f 

J J OA 

< r . r o 

J e iwil e-“ 2f ( ! a.)w 2n y( W ,X)rfw+ j e iul ^ f u 2n ~Y {lj, \)du = 0 

We want to solve equations (6) and (7) tor f£(w,X) in terms of Then the result follows. 

Equation (6) implies that the first 2 n moments of §£(w, X) are zero. Equation (7) means that 
the first 2n moments of e !WIJ: g^(tj,X) take prescribed values. (We assume Y(w,X) is known 
but not &y(w,X).) 
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/—\ 


/“'S 


Expanding e iuXl as a Taylor series (and using equation (6)), we write equation (7) as 


£ 

lm=2n+l J 


, ,21 (9 __< . « . dx 

—y(u, x)ju = - 


£ ( iw ) 


m 2!i m I 2 „ _ w >t d dx 




m! 




:/■ 


,Wl e- w *(iw)y(w, X)dw 


(8). 




The moments of ^y(w,X) are 


W„ 


[ e- w2t 4-y(w,X)o; m cfw 
*/ aX 


and define 

A = [ 
p d\ J 

Using (9) and (10) we rewrite equations (8) as 

(*'*i) 


e ^i e -^t{i u )uPY(u,\)du 


(9) 


( 10 ) 


£ 


m— 271+1 


ml 


-w m = Ai 


E 


(iii) 


m—2n 


(m — 2n)! 




*-2n+l 


( 11 ) 


It will always be possible to solve these equations for W m and there will be infinitely many 
solutions. To see this, we set 


and write equation (11) as 


( 


(**i) 



W m = 0 ,m>4n + l 


(4n-H)! 

fw 2n+1 \ 

2n\ / 

^.W4n+1/ 


Ai 

A 2n +1 


( 12 ) 


(13) 


It is possible to solve (13) if the determinant is non-zero. The determinant is of form 
X(i 1 )i 2n+1 ) 2 . (This follows directly from the form of the matrix) and so is either zero 
for all x x or else never zero. The determinant is also the Wronskian of the function 


2 n\ 


(»U 4n+1 

(4n-fl)! 


and as these functions are lineariy independent, it cannot vanish 
everywhere. Hence the determinant never vanishes and we can solve for the W m 's in terms 
of the A p ’s. Relaxing the condition (12) gives us infinitely many solutions. 


Thus, we have shown that it is possible to alter continuously without changing the values 
of the first n derivatives at both points. This means that the determinant of the 2n-order 
matrix in the moments will in genera! be non-zero; it can only be zero for a finite set of x x 
and there are an infinite set of possible values for x x compatible with the first n derivatives 
at the points. 
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